How to Round Subspaces: A New Spectral Clustering Algorithm
نویسنده
چکیده
A basic problem in spectral clustering is the following. If a solution obtained from the spectral relaxation is close to an integral solution, is it possible to find this integral solution even though they might be in completely different basis? In this paper, we propose a new spectral clustering algorithm. It can recover a k-partition such that the subspace corresponding to the span of its indicator vectors is O( √ OPT) close to the original subspace in spectral norm with OPT being the minimum possible (OPT ≤ 1 always). Moreover our algorithm does not impose any restriction on the cluster sizes. Previously, no algorithm was known which could find a k-partition closer than o(k · OPT). We present two applications for our algorithm. First one finds a disjoint union of bounded degree expanders which approximate a given graph in spectral norm. The second one is for approximating the sparsest k-partition in a graph where each cluster have expansion at most φk provided φk ≤ O(λk+1) where λk+1 is the (k + 1) eigenvalue of Laplacian matrix. This significantly improves upon the previous algorithms, which required φk ≤ O(λk+1/k).
منابع مشابه
Innovation Pursuit: A New Approach to the Subspace Clustering Problem
This paper presents a new scalable approach, termed Innovation Pursuit (iPursuit), to the problem of subspace clustering. iPursuit rests on a new geometrical idea whereby each subspace is identified based on its novelty with respect to the other subspaces. The subspaces are identified consecutively by solving a series of simple linear optimization problems, each searching for a direction of inn...
متن کاملFiltrated Algebraic Subspace Clustering
Subspace clustering is the problem of clustering data that lie close to a union of linear subspaces. Existing algebraic subspace clustering methods are based on fitting the data with an algebraic variety and decomposing this variety into its constituent subspaces. Such methods are well suited to the case of a known number of subspaces of known and equal dimensions, where a single polynomial van...
متن کاملSubspace Clustering via New Low-Rank Model with Discrete Group Structure Constraint
We propose a new subspace clustering model to segment data which is drawn from multiple linear or affine subspaces. Unlike the well-known sparse subspace clustering (SSC) and low-rank representation (LRR) which transfer the subspace clustering problem into two steps’ algorithm including building the affinity matrix and spectral clustering, our proposed model directly learns the different subspa...
متن کاملSpectral Curvature Clustering for Hybrid Linear Modeling A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY
The problem of Hybrid Linear Modeling (HLM) is to model and segment data us-ing a mixture of affine subspaces. Many algorithms have been proposed to solve thisproblem, however, probabilistic analysis of their performance is missing. In this the-sis we develop the Spectral Curvature Clustering (SCC) algorithm as a combinationof Govindu’s multi-way spectral clustering framework (C...
متن کاملar X iv : 0 81 0 . 37 24 v 2 [ st at . M L ] 1 5 Ja n 20 09 Foundations of a Multi - way Spectral Clustering Framework for Hybrid Linear Modeling ∗
The problem of Hybrid Linear Modeling (HLM) is to model and segment data using a mixture of affine subspaces. Different strategies have been proposed to solve this problem, however, rigorous analysis justifying their performance is missing. This paper suggests the Theoretical Spectral Curvature Clustering (TSCC) algorithm for solving the HLM problem, and provides careful analysis to justify it....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016